Minimum Sample Risk Methods for Language Modeling1
نویسندگان
چکیده
This paper proposes a new discriminative training method, called minimum sample risk (MSR), of estimating parameters of language models for text input. While most existing discriminative training methods use a loss function that can be optimized easily but approaches only approximately to the objective of minimum error rate, MSR minimizes the training error directly using a heuristic training procedure. Evaluations on the task of Japanese text input show that MSR can handle a large number of features and training samples; it significantly outperforms a regular trigram model trained using maximum likelihood estimation, and it also outperforms the two widely applied discriminative methods, the boosting and the perceptron algorithms, by a small but statistically significant margin.
منابع مشابه
On the Design of Feedback Controllers for a Convecting Fluid Flow via Reduced Order Modeling1
Fluid Flow via Reduced Order Modeling1 John A. Burns, Belinda B. King Center for Optimal Design and Control Interdisciplinary Center for Applied Mathematics Virginia Polytechnic Institute and State University Blacksburg, VA 24061{0531 Diana Rubio Center for Research in Scienti c Computation North Carolina State University Raleigh, NC 27695{8205 Abstract In this paper, we study the e ect of mode...
متن کاملMinimum Sample Risk Methods for Language Modeling
This paper proposes a new discriminative training method, called minimum sample risk (MSR), of estimating parameters of language models for text input. While most existing discriminative training methods use a loss function that can be optimized easily but approaches only approximately to the objective of minimum error rate, MSR minimizes the training error directly using a heuristic training p...
متن کاملAssessment and treatment of childhood apraxia of speech: An inquiry into knowledge and experience of speech-language pathologists
Objectives: The present research aimed to identify the assessment and treatment processes implemented by Iranian speech-language pathologists (SLPs) for CAS and to investigate the possibility of impact of their knowledge level and years of experience on their choice of assessment and treatment. Methods: A cross-sectional method using survey design was employed to obtain a sample of 260 SLPs w...
متن کاملFinancial Engineering Estimation of Minimum Risk Hedge Ratio
In this paper, the financial engineering minimum risk-based portfolio hedging model is first analyzed. It is then followed by the investigation on various major estimation methods for the minimum risk hedge ratio. The results revealed in the current study show that the HR obtained by the ordinary least squares (OLS) model is maximal and the out-of-sample hedging performance is the best; however...
متن کامل